-
Notifications
You must be signed in to change notification settings - Fork 181
Refactor(FindSimiliar): MilvusCache to use Milvus Search API #352
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
If the code changes are fine, I can add a integration test for milvus cache. Please advice if my code is incorrect, Im open to criticism. |
@srini-abhiram this is cool! can you sign the DCO
|
Replaces manual similarity calculation and query-based retrieval in FindSimilar with Milvus's Search API for more efficient and accurate similarity search. Updates index creation to use the new HNSW index API. Improves cache hit/miss logic and error handling. Signed-off-by: Srinivas A <[email protected]>
dbe4332
to
38ffba0
Compare
@rootfs I have followed your instructions and signed the commit. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good, thanks!
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
@rootfs I haven't added the integration test case for milvus Search, I am working on it. Should I create a seperate PR when I'm done? |
sure, plz go ahead in a separate PR, thanks! @srini-abhiram |
Replaces manual similarity calculation and query-based retrieval in FindSimilar with Milvus's Search API for more efficient and accurate similarity search. Updates index creation to use the new HNSW index API. Improves cache hit/miss logic and error handling.
What type of PR is this?
refactor(FindSimilar): Migrate to Milvus for similarity search
What this PR does / why we need it:
This PR refactors the FindSimilar functionality to use the Milvus vector database for similarity search, replacing the previous manual calculation and query-based retrieval logic.
Key changes include:
Adopting Milvus Search API: All similarity search operations now leverage Milvus's native Search API, which is highly optimized for performance and accuracy.
HNSW Indexing: The index creation process has been updated to use the new HNSW (Hierarchical Navigable Small World) index API, which provides faster and more accurate search results for large-scale vector data.
Code Improvements: The caching logic has been streamlined, and error handling for interactions with the Milvus service has been made more robust.
This migration was necessary to improve the efficiency, scalability, and accuracy of our similarity search feature, reducing the maintenance overhead of the custom-built solution using Go.
Which issue(s) this PR fixes:
Fixes #150
Release Notes: No